Uncertainty-Aware Reinforcement Learning for Collision Avoidance

نویسندگان

Gregory Kahn

Adam Villaflor

Vitchyr Pong

Pieter Abbeel

Sergey Levine

چکیده

Reinforcement learning can enable complex, adaptive behavior to be learned automatically for autonomous robotic platforms. However, practical deployment of reinforcement learning methods must contend with the fact that the training process itself can be unsafe for the robot. In this paper, we consider the specific case of a mobile robot learning to navigate an a priori unknown environment while avoiding collisions. In order to learn collision avoidance, the robot must experience collisions at training time. However, high-speed collisions, even at training time, could damage the robot. A successful learning method must therefore proceed cautiously, experiencing only low-speed collisions until it gains confidence. To this end, we present an uncertainty-aware model-based learning algorithm that estimates the probability of collision together with a statistical estimate of uncertainty. By formulating an uncertainty-dependent cost function, we show that the algorithm naturally chooses to proceed cautiously in unfamiliar environments, and increases the velocity of the robot in settings where it has high confidence. Our predictive model is based on bootstrapped neural networks using dropout, allowing it to process raw sensory inputs from high-bandwidth sensors such as cameras. Our experimental evaluation demonstrates that our method effectively minimizes dangerous collisions at training time in an obstacle avoidance task for a simulated and real-world quadrotor, and a realworld RC car. Videos of the experiments can be found at https://sites.google.com/site/probcoll.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

Uncertainty Models for TTC-Based Collision-Avoidance

We address the problem of uncertainty-aware local collision avoidance within the context of time-to-collision based navigation of multiple agents. We consider two specific models that account for uncertainty in the future trajectories of interacting agents: an isotropic model which conservatively considers all possible errors, and an adversarial model that assumes the error is towards a head-on...

متن کامل

Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

Developing a safe and efficient collision avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generate its paths without observing other robots’ states and intents. While other distributed multirobot collision avoidance systems exist, they often require extracting agent-level features to plan a local collision-free action, which can be computation...

متن کامل

Combining Deep Reinforcement Learning and Safety Based Control for Autonomous Driving

With the development of state-of-art deep reinforcement learning, we can efficiently tackle continuous control problems. But the deep reinforcement learning method for continuous control is based on historical data, which would make unpredicted decisions in unfamiliar scenarios. Combining deep reinforcement learning and safety based control can get good performance for self-driving and collisio...

متن کامل

Differential Adaptive Stress Testing of Airborne Collision Avoidance Systems

The next-generation Airborne Collision Avoidance System (ACAS X) is currently being developed and tested to replace the Traffic Alert and Collision Avoidance System (TCAS) as the next international standard for collision avoidance. To validate the safety of the system, stress testing in simulation is one of several approaches for analyzing nearmid-air collisions (NMACs). Understanding how NMACs...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1702.01182 شماره

صفحات -

تاریخ انتشار 2010

Uncertainty-Aware Reinforcement Learning for Collision Avoidance

نویسندگان

چکیده

منابع مشابه

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

Uncertainty Models for TTC-Based Collision-Avoidance

Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

Combining Deep Reinforcement Learning and Safety Based Control for Autonomous Driving

Differential Adaptive Stress Testing of Airborne Collision Avoidance Systems

عنوان ژورنال:

اشتراک گذاری